OPEN 3 ACCESS Freely available online 



•0-PLOS I ONE 



Nucleotide Diversity of Maize ZmBTI Gene and (S\ 
Association with Starch Physicochemical Properties c ^^ k 

Shuhui Xu 1 , Zefeng Yang 1 , Enying Zhang 1 ' 2 , Ying Jiang 1 , Liang Pan 1 , Qing Chen 1 , Zhengwen Xie 1 , 
Chenwu Xu 1 

1 Jiangsu Key Laboratory of Crop Genetics and Physiology/Co-lnnovation Center for Modern Production Technology of Grain Crops, Key Laboratory of Plant Functional 
Genomics of the Ministry of Education, Yangzhou University, Yangzhou, China, 2 College of Agronomy and Plant Protection, Qingdao Agricultural University, Qingdao, 
Shandong, China 



Abstract 

Cereal Brittlel protein has been demonstrated to be involved in the ADP-GIc transport into endosperm plastids, and plays 
vital roles in the biosynthesis of starch. In this study, the genomic sequences of the ZmBTI gene in 80 elite maize inbred 
lines were obtained, and the nucleotide polymorphisms and haplotype diversity were detected. A total of 30 variants, 
including 22 SNPs and 8 indels, were detected from the full sequences of this gene. Among these polymorphic sites, 9 SNPs 
and 2 indels were found to be located in the coding region. The polymorphisms of CDS sequences classified the maize 
ZmBTI gene into 6 haplotypes, which encode 6 different ZmBTI proteins. Neutrality tests revealed a decrease in population 
size and/or balancing selection on the maize ZmBTI locus. To detect the association between sequence variations of this 
gene and the starch physicochemical properties, 7 pasting and 4 gelatinization traits of starch were measured for the tested 
inbred lines using rapid visco analyzer (RVA) and differential scanning calorimeter (DSC), respectively. The result of 
association analysis revealed that an indel in the coding region was significantly associated with the phenotypic variation of 
starch gelatinization enthalpy. 
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Introduction 

Starch or amylum is a carbohydrate consisting of a large 
number of glucose units joined by glycosidic bonds. Starch rich 
crops are the main source of dietary energy for the world's 
population. It has been believed that plant species share the 
evolutionarily conserved pathway of starch biosynthesis starting 
from the carbon dioxide fixation, followed by transitory starch 
degradation, sucrose synthesis, and starch synthesis in the storage 
organs [1]. Four classes of enzymes are included in the starch 
biosynthesis pathway, they are ADP-glucose pyrophosphorylase 
(AGPase), starch synthase (SS), starch branching enzyme (SBE), 
and starch debranching enzyme (DBE) [2,3]. Among them, 
AGPase catalyzes the first committed and rate-limiting step in this 
pathway, and plays vital role in the biosynthesis of starch [4]. 
Under the catalyzing of AGPase, ADP-glucose (ADP-GIc) is 
synthesized in the cytosol of cereal endosperms as the main 
precursor for starch synthesis and has to be subsequently imported 
into the storage plastids [4] . Because of the importance of cereals 
in the production of storage starches for human diet and other 
industrial usage, the activity of the ADP-GIc transporter have 
gained many attentions as a key component of the starch 
biosynthesis pathway [5]. 



One of the ADP-GIc transporters demonstrated clearly in cereal 
is the protein Brittlel. Brittlel proteins are plant nucleotide 
transporters involved in the mitochondrial carrier family (MCF) 
[6]. The proteins in MCF family transports nucleotides, amino 
acids, inorganic ions, fatty acids, keto acids and cofactors across 
the mitochondrial membrane [7]. Physiological researches on the 
maize Brittlel mutant have revealed that ZmBTI (Zea mays 
Brittlel protein) was involved in the ADP-GIc transport into 
endosperm plastids, and played critical roles in the biosynthesis of 
starch [4,8,9]. The maize endosperm with bll mutant is severely 
reduced in starch content, which results in kernels with a collapsed 
angular appearance at maturity [4] . The amyloplasts from young 
kernels isolated from endosperms with btl mutant were only 25% 
as active in ADP-GIc uptake and conversion to starch as 
amyloplasts from normal and mutant maize endosperms, suggest- 
ing that ZmBTI is involved in the transport of ADP-GIc into 
maize endosperm plastids [8,10]. The researches in other cereals 
also revealed that the homologs of BT1 protein possessed the 
ability in transporting ADP-GIc. For example, the barley lys5 
mutant, a homolog of BT1 (HvNSTl), show a reduced capacity 
for ADP-GIc uptake by isolated endosperm amyloplasts [5,11]. 
However, the BT1 proteins in dicots, such as AtBTl in Arabidosis 
and StBT 1 in Solatium tuberosum do not transport ADP-GIc, but 
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instead transport AMP, ADP and ATP in a unidirectional mode 
[12]. ^ 

Maize (Zea mays L.) is one of the most important grown cereals 
in the world. It provides staple food to many populations, as well 
as a major nutrient source for animal feed. In addition, benefitting 
from its unique character such as low pasting temperature and 
slow tendency of retrogradation, maize starch is one of the 
important raw materials for industrial production of food. The 
pasting properties of maize starch will enormously affect fabrica- 
tion property, flavor characteristics and keeping in storage. 
Recently, the RVA profile of starch paste viscosity was widely 
employed to evaluate the quality of cereal crops, because this 
method requires only a small size and the procedure is easy to 
perform [13,14]. Starch gelatinization, one of the most important 
and unique properties, refers to the process of the disruption of 
granular structure causing starch molecules to dissolve in water 
[15]. The gelatinization properties of starch are the most 
important indexes in many food modification including cooking, 
baking and extruding starch-based foods [16]. Although the 
physiological roles on the starch biosynthesis of the maize ZmBTl 
gene has been illustrated, the effect of this gene in the formation of 
maize starch pasting and gelatinization properties reminds 
unknown. Moreover, there is no association analysis between the 
nucleotide polymorphisms of the maize ZmBTl gene and the 
physicochemical properties of maize starch. In this work, we 
analyzed the nucleotide polymorphism of maize ZmBTl locus, 
and investigate the association between the sequence polymor- 
phisms of the maize ZmBTl gene and some starch pasting and 
gelatinization properties. 

Materials and Methods 

Plant materials and sequencing the maize ZmBTl gene 

A total of 80 elite maize inbred lines were used in this study. 
These inbred lines were also the representative lines, including 
temperate germplasm from 5 heterotic groups, tropic and waxy 
germplasm (Table 1). They represented most of the genetic 
diversity available to breeding and research programs in China. 
In addition, some germplasm introduced from other countries 
were also included in this study. The inbred lines were grown in 
two-row plots with an randomized block design of two repetitions 
in a natural environment during 2012 in Sanya, Hainan province. 
The mature seeds for each inbred lines were harvested in bulk for 
phenotypic data analysis. 

Young plant leaves were collected at the four-leaf stage for each 
accession and stored at — 80°C until genomic DNA extraction. 
Genomic DNA was extracted from the frozen young leaves of the 
80 inbred lines using CTAB (cetyl trimethyl ammonium bromide) 
method [17] according to the modified protocol. The sequences of 
the ZmBTl gene in the tested inbred lines were sequenced using 
the target sequence capture sequencing technology on the 
NimbleGen platform [18] by BGI Life Tech Co., Ltd.. The 
genomic sequence and position of the maize ZmBTl gene 
(GRMZM2G144081) of the inbred line B73 were used as the 
reference sequences for target sequence capture. 

Measurement of maize starch pasting and gelatinization 
properties 

The pasting properties were measured using a rapid visco 
analyser (RVA) (Model No. RVA-3D, Newport Scientific, Sydney, 
Australia). A total of 3-g starch from each inbred line was 
dispersed in 25 ml of distilled water in the viscometer test canister. 
The sequential temperature curve for a 12.5 min test was as 
follows: (1) incubate at 50°C for 1.0 min; (2) increase to 95°C; (3) 



keep at 95°C for 2.5 min; (4) cool down to 50°C; and (5) hold at 
50°C for 1.4 min. The viscosity was evaluated using a constant 
paddle rotation of 160 rpm. Viscosity values were recorded in 
centipose (cp). 

The gelatinization properties of mazie starches were analyzed 
using a differential scanning calorimeter DSC 200F3 Maia 
(Netzsch, Germany). Starch samples (5 mg, dried starch basis) 
were precisely weighed in the sample pans, mixed with distilled 
water (10 JJ.1), and sealed. The heating rate was at 10°C per min 
over the temperature range of 20-100°C. The gelatinization 
properties were recorded with a thermal analysis data station 
equipped in DSC. 

Sequence analysis 

Multiple sequence alignment of the maize ZmBTl gene was 
performed using Clustal X and was further edited manually. The 
software DNASP 5.0 [19,20] was used to analyze sequence 
nucleotide polymorphism and allelic diversities. Two parameters 
of nucleotide diversity, it and 6 were estimated. Where n is the 
average number of nucleotide differences per site between any two 
DNA sequences, and 8 is derived from the total number of 
segregating sites and corrected for sampling size. Tajima's D [21] 
and Fu and Li's D* and F* [22] statistical tests were used to test the 
evidence of neutral evolution within the selected population and 
each defined region. The minimum number of recombination 
events [23] was estimated in the period of evolution of ZmBTl 
gene among the tested inbred lines. The linkage disequilibrium 
(LD) between any two polymorphic sites were estimated using 
TASSEL v3.0 [24]. In addition, the decay of LD with physical 
distance in ZmBT gene was evaluated by regression analysis 
(PROC NLIN and REG in SAS software). The regression models, 
including linear, loglinear, exponential, power and Remington's 
models [25], were used in this study. 

Population structure and association analysis 

Population structure is a major bias factor leading to false- 
positive associations. To alleviate the effect of population structure, 
all inbred lines were genotyped with the SNP chips contained 
3,072 random SNP markers evenly covering the maize genome. 
These SNP markers were selected from 49,585 SNP markers used 
by recently reported chips [26]. SNP genotyping was performed 
via the GoldenGate assay at the National Maize Improvement 
Centre of China, China Agricultural University. The population 
structure was evaluated by these SNP markers, and the resulting 
Q_-values were obtained from the STRUCTURE program [27]. 
Five independent runs were performed setting the number of 
populations (k) from 2 to 8, burn in time and MCMC (Markov 
Chain Monte Carlo) replication number both to 100,000, and a 
model for admixture and correlated allele frequencies. The k value 
was determined by LnP(D) in STRUCTURE output and an ad 
hoc statistic Ak based on the rate of change in LnP(D) between 
successive k. The tests of significant association between the 
sequence polymorphisms with Minor Allele Frequency (MAF) 
>0.05 and starch pasting and gelatinization properties in the 
tested population were performed using the general linear model 
(GLM) in the TASSEL software v3.0 [24]. 

Results 

Nucleotide diversity and selection of the maize ZmBTl 
gene 

The position and nucleotide sequences of the maize ZmBTl 
gene in inbred line B73, whose genome has been fully sequenced, 
were used as the references to capture of the sequences of this gene 



PLOS ONE | www.plosone.org 



2 



August 2014 | Volume 9 | Issue 8 | e103627 



Diversity and Association Analysis of the Maize ZmBTI Gene 



Table 1. List of the 80 maize inbred lines used in this study. 



No. 


Inbred Line 


Origin and Pedigree 


No. 


Inbred Line 


Origin and Pedigree 


1 


QH19612 


Derived from Huangzaosi 


41 


6819 


Derived from P78599 


2 


Chang7-2 


(Huangzaosi xWei95) xS901 


42 


Dan988 


Derived from P78599 


3 


LX9801 


Ye502xH21 


43 


319B 


Derived from P78599 


4 


107 


DekalbxL80 


44 


Qi319 


Derived from P78599 


5 


k12 


Huangzaosi xHuaichun 


45 


Qi318 


Derived from P78599 


6 


nx335 


NF358xPH4CV 


46 


Shen137 


Derived from P6JK1 1 1 


7 


Ji853 


(Huangzaosi xZi330) xHuangzaosi 


47 


11099 


Tropical germplasm 


8 


412 


Mo17xJing09 


48 


suwan 


Tropical germplasm 


9 


Za107 


DekalbxL80 


49 


11118 


Tropical germplasm 


10 


Huangzaosi 


Derived from Sipingtou 


50 


11200 


Tropical germplasm 


11 


502 


Huangzaosi xDan340 


51 


10533-1 


Tropical germplasm 


12 


Luyuan92 


Yuanqi122x1137 


52 


RCML15 


Tropical germplasm 


13 


10168 


Derived from 5003 


53 


RBS11 


Tropical germplasm 


14 


Dan598 


(Dan340xDanhuang1 1) x(Danhuang02 x599) 


54 


Q52 


Derived from an USA crossbreed 


15 


Zong3 


Zi330xLv28 


55 


KWS456 


Derived from KWS 


16 


4CV 


Derived from Mo17 


56 


QF-11 


Unknown 


17 


E28 


(A619HH xLvjiukuan) xLvjiukuan 


57 


QF01 


Derived from French germplasm 


18 


S122 


(YeH201 xDan340) xDan340 


58 


JND-1 


Derived from a Canadian germplasm 


19 


Dan340 


Baigulv9xPod corn 


59 


BJ-2 


Derived from Mo17 


20 


Danhuang25 


Derived from P78599 


60 


QF02 


Derived from French germplasm 


21 


JH3372 


Shen5003xZi330 


61 


QDM01 


Derived from KWS 


22 


Dan99 


Derived from Lvdahonggu35 


62 


Ml 


Unknown 


23 


Qi232 


Unknown 


63 


WT262 


Derived from KWS 


24 


OH43 


OH40bxW8 


64 


WT26 


Derived from KWS 


25 


M017 


C103x187-2 


65 


Y53 


Unknown 


26 


8112 


P3382xP3147 


66 


BJ-5 


Derived from Mo17 


27 


K8112 


P3382xP3147 


67 


QZN01F 


Waxy germplasm 


28 


Wu314 


(302DxHuangbaoli) xHuangzaosi 


68 


QKN01M 


Waxy germplasm 


29 


4866 


7922xYe478 


69 


QDT01 


Waxy germplasm 


30 


3189 


U8112xShen5003 


70 


QLY01Z 


Waxy germplasm 


31 


Tie9206 


Tie8706xTie8708 


71 


QJKN20F 


Waxy germplasm 


32 


Benyu15 


Unknown 


72 


QZN12 


Waxy germplasm 


33 


478Xuan 


U8112xShen5003 


73 


QKN01F 


Waxy germplasm 


34 


Zheng58 


Derived from Ye478 


74 


QBN48 


Waxy germplasm 


35 


7922 


Derived from P3382 


75 


QBN029 


Waxy germplasm 


36 


178 


P78599xMo17 


76 


BEM 


Waxy germplasm 


37 


QP1721 


Derived from P78599 


77 


QBN02 


Waxy germplasm 


38 


Exhan 


Derived from P78599 


78 


QBN3186 


Waxy germplasm 


39 


xy35 


Derived from Xianyu335 


79 


M1132 


Waxy germplasm 


40 


P138 


Derived from P78599 


80 


QZN012 


Waxy germplasm 



doi:10.1371/joumal.pone.0103627.t001 



in 80 inbred lines. Sequence polymorphisms were detected among 
80 maize inbred lines across 2,442 bp of sequence, which covers a 
520 bp 5' upstream promoter region, a 624 bp exon_l region, a 
131 bp intron_l region, a 162 bp exon_2 region, a 128 bp 
intron_2 region, a 534 bp exon_3 region and a 337 bp 3'-UTR 
region. Nucleotide substitutions and indels at the ZmBTI locus 
were identified, and the results were summarized in Table S 1 and 
Table 2. From the putative genomic sequences in 80 maize inbred 
lines, a total of 30 variants were identified, including 22 SNP sites 



and 8 indels. Among all the SNP sites, only one belongs to 
singleton variable site, while the other 2 1 sites belong to parsimony 
informative sites (Tables 2 and SI). In addition, 2 indels were 
found to be singleton variations, while the other 6 indels belonged 
to parsimony variations. For all the 80 inbred lines, the overall 
nucleotide diversity (n) of ZmBTI locus was 0.00351. However, we 
also noticed that the polymorphic sites were unevenly distributed 
among 7 defined regions of maize ZmBTI locus. There is no 
nucleotide substitution in the regions of intronl, exon2 and 
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intron2. In addition, no indel was found in exon2, while all the 
other regions possessed at least one indel. 

The Tajima's D statistic is a widely used test to identify 
sequences which do not fit the neutral theory model at equilibrium 
between mutation and genetic drift. In this analysis, the estimates 
of Tajima's D in the regions of promoter and exonl were both 
statistically higher than 0 at the level of 0.01. In addition, we also 
noticed that the Tajima's D statistic for the entire region of the 
maize ZmBTI gene was statistically higher than zero. Further- 
more, when we combined all three exons, the estimate of Tajima's 
D was 3.23599, which was also statistically significant at the level 
of 0.01. These results revealed that low levels of both low and high 
frequency polymorphisms in maize ZmBTI locus, and also 
indicated a decrease in population size and/ or balancing selection. 
In addition, the estimates of Fu and Li's F* for both coding 
(2.3416) and entire regions were significant for the ZmBTI gene, 
also suggesting balancing selection on this gene. 

Haplotype diversity of the maize ZmBTI gene 

According to the full length of the ZmBTI gene in the tested 80 
maize inbred lines, a total of 1 1 haplotypes were detected with a 
haplotype diversity (Hd) equal to 0.7734 (Table 3). The tested 
inbred lines were unbalancedly distributed in these haplotypes. 
Among the haplotypes identified in this analysis, 6 contained only 
one inbred line. The most frequent haplotype was Hap_l, which 
contained 29 inbred lines. In addition, we also noticed that four 
frequent haplotypes, including Hap_l^, contained 90% of the 
tested inbred lines. 

In the coding region of the maize ZmBTI gene, a total of 9 
SNPs were detected. In addition, 2 indels were also identified in 
the coding regions. When we used the coding sequences to identify 
the hapotype diversity, a total of 6 haplotypes were identified for 
these 80 inbred lines (Table 3) with a Hd equal to 0.7440. Each of 
the haplotypes defined by coding sequences of ZmBTI gene 
contained at least 2 inbred lines. The most frequent CDS 
haplotype was CDS_Hap_l, which contained 32 inbred lines. 

Among the SNPs detected in the coding regions, only one 
belonged to the nonsynonymous site which could cause the 
replacement change of amino acid sequences. In addition, the 
variation of 2 indels could also result in the change of ZmBTI 
protein. Both of these two indels covered 3 or 6 nucleotides, which 
will not result in frame shift during translation. The indel7 
contained three types, including no deletions, 3 and 6 nucleotides 
deletions. When we translated the CDS into amino acid 
sequences, 6 types of ZmBTI protein sequences were found to 
be encoded by these inbred lines (Fig. 1). Three evolutionarily 
conserved mitochondrial carrier protein domains (Mito_carr, 
PF00153) were detected in maize ZmBTI protein using the tool 
of Pfam. However, None of the four variants of amino acid caused 
by two indels and one nonsynonymous SNP was located in the 
regions of these three domains. 

Linkage disequilibrium and recombination events 

Linkage disequilibrium was investigated between pairwise 
segregating sites in order to predict the expected resolution and 
marker density needed for candidate-gene association mapping. In 
this analysis, all the SNPs identified in maize ZmBTI gene and the 
values of r 2 were used and the result revealed that more than half 
of the pairs between any two polymorphic sites of maize ZmBTI 
gene (130 out of 231 for the tested LD are significant at 
P< 0.0001) showed significant linkage disequilibrium (LD). To test 
the decay of LD with increasing physical distance, some regression 
equations, including linear, loglinear, exponential, power and 
Remington's models, were estimated. In this analysis, the linear 
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Table 3. The distribution of haplotypes of ZmBTI gene in 80 inbred lines using both the full-length sequence and coding regions. 




Haplotype 


Number 


CDS haplotype 


Inbred line 


Hap_1 


29 


CDS_Hap_l 


Wu314, Dan988, QH19612, Exhan, Dan99, QZN12, Ml 132, 502, QJ319, QF01, QBN029, JND-1, 
QDT01, QKN01F, P138, Luyuan92, 178, Shen137, 6819, Chang7-2, QDM01, WT262, LX9801, 107, 
JH3372, 11200, BJ-5, Danhuang25, 412 


Hap_2 


16 


CD5_Hap_2 


nx335, RBS1 1, QJKN20F, xy35. Ml, 1 1099, 7922, Tie9206, SI 22, QF02, Y53, 4CV, WT26, Mo17, BJ- 
2, BEM 


Hap_3 


18 


CDS_Hap_3 


QLY01Z, RCML15, QBN02, E28, Q52, OJ232, K81 12, Benyu15, 81 12, QZN01 F, 1 1 1 18, 4866, Za107, 
Dan340, 319B, QF-1 1, Zong3, Zheng58 


Hap_4 


9 


CD5_Hap_4 


QZN012, QBN48, KWS456, QBN3186, 10533-1, 3189, Qi318, 10168, 478Xuan 


Hap_5 


1 


CDS_Hap_1 


OH43 


Hap_6 


1 


CD5_Hap_5 


QP1721 


Hap_7 


2 


CDS_Hap_6 


QKN01M, Dan598 


Hap_8 


1 


CDS_Hap_1 


Huangzaosi 


Hap_9 


1 


CDS_Hap_5 


k12 


Hap_10 


1 


CDS_Hap_1 


suwan 


Hap_1 1 


1 


CDS_Hap_6 


Ji853 


doi:1 0.1 371 /joumal.pone.01 03627.t003 



regression model was selected to fit the data, because this model 
possessed the highest coefficient of determination. Our result 
revealed that the LD decayed rapidly with increasing physical 
distance. The predicted value of r 2 declined to 0.1 within 2184 bp 
at ZmBTI locus (Fig. 2). 

The polymorphic sites in the entire Zmisa2 locus were used to 
detect the evidence of recombination. The patterns of the 
polymorphisms identified in inbred lines surveyed in this study 
indicated the history of recombination at ZmBTI locus, which 
contributed to the haplotype diversity and the decay of LD. 
However, only one recombination event has been detected 



according to the algorithm of Hudson and Kaplan for minimum 
number of recombination events, and the recombination has been 
detected between sites 979-1793 bp. 

The phenotypic variations and association analysis 

Pasting properties of various corn starches measured by RVA, 
including PV, TV, BD, FV, SB, PT and PTP have been 
summarized in Table 4. The gelatinization temperatures (onset, 
T 0 ; peak, T p ; and conclusion, T c ) and enthalpy of gelatinization 
(AH), for maize starches from different inbred lines, measured 



CDS 


_Hap_ 


1 


CDS 


Hap 


2 


CDS 


Hap 


3 


CDS 


Hap_ 


4 


CDS 


Hap_ 


5 


CDS_ 


_Hap_ 


6 


CDS_ 


_Hap_ 


: 


CDS_ 


_Hap_ 


2 


CDS_ 


_Hap_ 


3 


CDS_ 


_Hap_ 


4 


CDS_ 


_Hap_ 


5 


CDS 


Hap 


6 



CDS_ 


_Hap_ 


1 


CDS_ 


]Hap" 


2 


CDS_ 


_Hap_ 


3 


CDS_ 


_Hap_ 


4 


CDS_ 


_Hap_ 


5 


CDS_ 


_Hap_ 


e 



CDS 


Hap 


1 


CDS 


Hap_ 


2 


CDS_ 


_Hap_ 


3 


CDS 


_Hap_ 


4 


CDS_ 


_Hap_ 


5 


CDS_ 


Hap 


6 



* 20 * 40 * 60 * SO * 100 * 
MAflTMAVTTMVTBSKESWSSLQVPSV.AFPWKPRGGGKTGGLEFPRRAMFASVGLNVCPGVPAGRDPREPDPKVVRSADNCDiaBQLGflAVPGGG^SWEAEEEflAKRRKCK : 110 
MfiflTMAVTTMVTRSKESWSSLCVPRVAFPWKPRGG-KTGGLEFPRRAMFflSVGLNVCPGVPAGRDPREPDPKVVRflADNCDISRQLGSfiVPGgCAAWEAEEEASKRRKCK : 109 
MAATMAVTTMVTRSKESWSSLQVPAVAFPWKPRGG-KTGGLEFPRRAMFASVGLNVCPGVPAGRDPREPDPKWRAADNCDIARQLGAAVPGQQAAWEAEEEAAKRRKQK : 109 
MAATMAVTTMVTRSKESWSS1QVPAVAFPMKPRGGGKTGGLEFPRRAMFASVGLNVCPGVPAGRDPREPDPKVVRAADNCDIARQLGAAVPGQQAAWEAEEEAAKRRKCK : 110 
MAATMAVTTMVTRSKESWESLCVPAVAFPSlKPRGG-KTGGIEFPRRAMFASVGLNVCPGVPAGRDPREPDPKVVRAJiDnCDIARQLGAAVPGQQAAWEAEEEAAKRRKCK : 109 
MAATMAVTTMVTRSKESMSS1QVPAVAFPMKPRGG-KTGGLEFPRRAMFASVGLNVCPGVPAGRDPREPDPKVVRAADNCDIARCJLGAAVPGQQAAMEAEEEAAKRRKCK : 109 

120 * 140 * 160 * 1B0 * 200 * 220 

GGGSKKQQCLGDLSLRKVRVKIANPHLRRLVSGAIAGAVSRTFVAPLETIRTHLMVGEIGVDSMAGVFCWIMCJNEGWTGIFRGNAVHVLRVAPSKAIEHFTYDTAKKFLT : 22 0 

GGGSKKQQCLGDLSLRKVRVKIANPHLRRLVSGftlAGAVSRTFVAPLETIRTHLMVGSIGVDSMAGVFCWIMQNEGWTGLFRGNAVNVLRVAPSKAIEHFTYDTAKKFLT : 219 

GGGSKKQQQLGDLSLRKVRVKIANPHLRRLVSGAIJiGAVSRTFVAPLETIRTHLUVGSIGVBSMAGVFQWIMaNEGWTGLFRGNAVNVLRVAPSKAIEHFTYDTAKKFLT : 219 

GGGSKKCCCLGBLSLRKVRVKIANPHLRRLVSGAIAGAVSRTFVAPLETIRTHLMVGSIGVDSMAGVFOWIMCNEGWTGLFRGNAVNVLRVAPSKAIEHFTYDTAKKFLT : 22 0 

GGGSKKQ5CLGDLSLRKVR"VKIANPHLRRLVSGflIAGAVSRTFVftPLETIRTHLMVGSIGVDSMAGVFCWIMQNEGWTGLFRGNAVHVLR"VSPSKAIEHFTYDTAKKFLT : 219 

GGGSKKGCCLGDLSLRKVRVKIANPHLRRLVSGAIAGAVSRTFVAPLETIRTHLMVGSIGVDSMAGVFCWIMGNEGWTGLFRGNAVNVLRVAPSRA1EHFTYDTAKKFLT : 219 
Mito_carr Domain 1 

* 240 * 260 * 280 * 300 * 320 * 
PKGDEPPKIPIPTPLVAGALAGFASTLCTYPMELIKTRVTIEKDVYDNVAHAFVKILRDEGPSELYRGLTPSLIGVVPYAACNFYAYETLKRLYRRATGRRPGADVGPVA : 330 
PKGDEPPKIPIPTPLVAGALAGFASTLCTYPNELIKTRVTIEKDVYDNVAHAFVEILRDEGPSELYRGLTPSLIGVVPYAACNEYAYETLKRLYRRATGRRPGADVGPVA : 32 9 
PKGDEPPKIPIPTPLVAGALAGFASTLCTYPMELIKTRVTIERDVYDNVAHAFVKILRDEGPSELYRGLTPSLIGVVPYAACNFYAYETLKRLYRRATGRRPGADVGPVA : 32 9 
PKGDEPPKIPIPTPLVAGALAGFASTLCTYPMELIKTRVTIEKDVYDNVAHAFVKILRDEGPSELYRGLTPSLIGVVPYAACNFYAYETLKRLYRRATGRRPGADVGPVA : 330 
PKGDEPPKIPI PTPLVAGALAGFASTLCTYPMELIKTRVTIEKDVYDNVAHAFVKILRDEGPSELYRGLTPSLIGVVPYAACNFYAYETLKRLYRRATGRRPGADVGPVA : 32 9 
PKGDEPPKIPIPTPLVAGALAGFASTLCTYPMELIKTRVTIEKDVYDNVAHAFVKILRDEGPSELYRGLTPSLIGVVPYAACNFYAYETLKRLYRRATGRRPGADVGPVA : 32 9 

Mito carr Domain 2 

340 * 360 * 3B0 * 400 * 420 * 

TLI,IGSAAGAIASSATFPLEVARKaMQVGAVGGRQVYCNVLHAIYCILKKEGAGGLYRGLGPSCIKLMPAAGIAFMCYEACKKILVDEEEDFEEEBGAGGGEDDKKKVE : 439 

TLLIGSAAGAIASSATFPLEVARKQMQVGAVGGRCVYQNVLHAIYCILKKEGAGGLYRGLGPSCIKLMPAAGIAFMCYEACKKILVDKEEDEEEEDEAGGGEDDKKKVE : 438 

TLLIGSAAGAIASSATFPLEVARKQMQVGAVGGRQVYCHVLHAIYCILKKEGAGGLYRGLGPSCIKLMPAAGIAFMCYEACKKILVDKE-DEEEEDEAGGGEDDKKKVE : 437 

TLI.IGSAAGAIASSATFPLEVARKQMQVGAVGGRQVYCNVLHAIYCILKKEGAGGLYRGLGPSCIKLMPAAGIAFNCYEACKKILVDKE-DEEEEEEAGGGEDDKKKVE : 438 

TLLIGSAAGAIASSATFPLEVARKQMQVGAVGGRQVYCNVLHAIYCILKKEGAGGLYRGLGPSCIKLMPAAGIAEMCYEACKKILVDKEEDEEEEEGAGGGEDEKKKVE : 438 

TLLIGSAAGAIASSATFPLEVARKCMQVGAVGGRCVYSNVLHAIYCILKKEGAGGLYRGLGPSCIKLMPAAGIAFMCYEACKKILVDKE EEEEDEAGGGEDDKKKVE : 436 

Mito carr Domain 3 



Figure 1. Sequence alignment of maize ZmBTI proteins encoded by different CDS haplotypes. The haplotypes defined by the coding 
sequences of the maize ZmBTI gene were used as the sequence names. Polymorphisms from inferred amino acids were indicated by boxes. Three 
mitochondrial carrier protein domains were indexed by lines. 
doi:10.1371/journal.pone.0103627.g001 
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Figure 2. LD patterns across the whole locus of ZmBTI. (A) LD between pairs of ZmBTI sequence polymorphic sites. (B) Decay of LD between 

pairs of ZmBTI sequence informative polymorphisms. The linear regression coefficient b is —0.00029. 

doi:10.1371/journal.pone.0103627.g002 
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using DSC are also presented. Significant difference in all the 
pasting and gelatinization properties among different maize inbred 
lines was observed through one-way ANOVA. These results 
suggest that the 80 inbred lines used in this study are 



representative in terms of maize quality and are qualified for 
association analysis. 

To explore the relationship among 1 1 starch pasting and 
gelatinization properties, the pairwise correlation analysis was 
performed, and the Pearson correlation coefficients (r) between 
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any two parameters were obtained (Table 5). Interestingly, the 
significant correlations were found between any two pasting 
parameters, and only 5 pairwise correlations, including PT/PV, 
PT/TV, PT/FV, PT/SB and SB/PTP did not reach the 
significant level. Among the 6 pairwise correlations for gelatini- 
zation properties, only the pairs of AH /T a and AH / T c showed no 
significance. In addition, the correlations between the pasting and 
gelatinization properties were also investigated, and the results 
revealed that AH showed significant correlations with PV, TV, 
BD, PT and PTP, T c with PV and BD, and T p with BD. These 
results suggested that potentially different genetic mechanisms 
were responsible for these starch viscosity properties. 

GLM of association analysis that controlled the effects of 
population structure was used to identify relative association of 1 1 
starch pasting and gelatinization properties and genotype variants 
in maize ZmBTl gene. All nucleotide polymorphisms, including 
SNPs and indels, with a frequency of more than 0.05 of the rare 
alleles were considered in the association analysis of phenotype- 
genotype in both genes. Only one variant (indel7 in exon2) in 
maize ZmBTl gene showed significant association with AH, while 
all the other variants had no association with starch pasting and 
gelatinization properties. Indel7 can cause a deletion of glutamic 
acid (E) in 27 inbreds or a deletion of glutamic and asparagic acids 
(ED) in 3 inbreds. Because the latter possessed a frequency of 
lower than 0.05 in the tested population, it was not used in the 
association analysis. According to the result of association analysis, 
indel7 explained 9.26% of the phenotypic variant of starch AH. 
The mean value of AH was 6.091 with a standard deviation 1.087 
for the alleles carrying the deletion of glutamic acid (E) in protein 
product, and this value was statistically lower than those without 
deletion (6.449 + 0.766) based on independent samples t test 
(P<0.05). 

Discussion 

The abundant genetic variations enable plant breeders to create 
novel plant gene combinations and select crop varieties more 
suited to the needs of diverse agricultural systems. The analysis of 
the genetic diversity for crop functional genes is critical for 
understanding the genetic background of phenotypic variation, 
and in turn will provide great help for crop improvement [28,29]. 
In this study, 30 variants, including 22 SNPs and 8 indels, were 
identified in the full-length sequence of the maize ZmBTl gene. 
Among these SNPs, 9 were found in the coding region, one of 
which were nonsynonymous and the others were synonymous. In 
addition, there were two indels in the coding region of this gene. 
The nonsynonymous SNPs and indels in the coding region would 
result in the changes of protein product. The SNP sites and indels 
in the coding region also classified the tested inbred lines into 6 
haplotypes, which encode 6 deferring ZmBTl proteins. However, 
lower frequency of variant was found in the intron regions of this 
gene. Particularly, none of the SNPs was identified in two introns 
of this gene. This may be the result of that the intron regions in 
this gene are much shorter than the coding region. 

In this study, significantly positive statistics were obtained for 
promoter, coding and entire regions of the maize ZmBTl gene 
through Tajima's neutrality tests. Thus, a decrease in population 
size and/or balancing selection was suggested for this gene. 
Balancing selection refers to a number of selective processes by 
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